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ABSTRACT 

We demonstrate that optical data from SDSS, X-ray data from ROSAT and Chandra, and SZ 
data from Planck, can be modeled in a fully self-consistent manner. After accounting for systematic 
errors and allowing for property covariance, we find that scaling relations derived from optical and 
X-ray selected cluster samples are consistent with one another. Moreover, these clusters scaling 
relations satisfy several non-trivial spatial abundance constraints and closure relations. Given the 
good agreement between optical and X-ray samples, we combine the two and derive a joint set of 
Lx~M and Ysz~M relations. Our best fit Ysz~M relation is in good agreement with the observed 
amplitude of the thermal SZ power spectrum for a WMAP7 cosmology, and is consistent with the 
masses for the two CLASH galaxy clusters published thus far. We predict the halo masses of the 
remaining z < 0.4 CLASH clusters, and use our scaling relations to compare our results with a variety 
of X-ray and weak lensing cluster masses from the literature. 
Subject headings: cosmology: clusters 



1. INTRODUCTION 

Cluster scaling relations arc of fundamental impor- 
tance in both cosmological and astrophysical contexts. 
From the cosmological point of view, galaxy clusters are 
potentially the mos t precise probe of structure growth 
available today (e.g.lCunha et al.H2009b lOguri fc Takadal 
120101 : iWeinberg et all 120121 ). Given a cosmological 
model, robust theoretical predictions for the spatial 
abundance of massive halos as a function of mass (e.g. 
Jenkins et aHl2001|[Shetb. fc Tormen1l2002l: iWarren et all 
20061: iTinker et all 120081 : iBhattacharva et all 120111 ) are 



combined with cluster scaling relations to model the 
space-time abundance of galaxy clusters as a function of 
their observable properties (e.g., X-ray luminosity, Lx, 
Sunyacv-Zel'dovich (SZ) signal, Y"sz, optical richness, 
-^200 )■ Such an approach has enabled recent cosmolog- 
ical constraints from cluster survey counts at a variety 
of wavelengths (e.g. | Henrv et all 120091: IVikhlinin et al l 
' 20091: iMantz et alpOlOal: IRozo et alll2010l : lBenson et all 
20lTI: ISehgal et alll201lD . 
From an astr o physic al perspective, the self-similar 



model of iKaiserl ([1986D makes concrete predictions for 
how cluster scaling relations should behave in the ab- 
sence of non-gravitational physical processes. The spe- 
cific predictions of this model were quickly shown to be a 



1 Einstein Fellow, Department of Astronomy & Astrophysics, 
The University of Chicago, Chicago, IL 60637. 

2 Kavli Institute for Cosmological Physics, Chicago, IL 60637. 

3 APC, AstroParticule et Cosmologie, Univcrsite Paris 
Diderot, CNRS/IN2P3, CEA/lrfu, Observatoire de Paris, Sor- 
bonnc Paris Cite, 10, rue Alice Domon et Leonie Duquet, Paris 
Cedex 13, France. 

4 Jet Propulsion Laboratory, California Institute of Technol- 
ogy, 4800 Oak Grove Drive, Pasadena, CA, U.S.A. 

5 Departments of Physics and Astronomy and Michigan Cen- 
ter for Theoretical Physics, University of Michigan, Ann Arbor, 
MI 48109. 

6 SLAC National Accelerator Laboratory, Mcnlo Park, CA 
94025. 

7 Lawrence Berkeley National Laboratory, Berkeley, CA 
94720. 



poor match to the local X-ray cluster luminosity function 
(jEvrard fc Henrvlll99lHKaiserlll9911 ). but the underlying 
notion that massive halos are nearly self-similar remains 
well s upported by curr ent data and numerical simula- 
tions (jAllen et al.l 1201 ll and references therein.). 

The intrinsic scaling behavior of massive halos across 
space-time is formally a theoretical construct, indepen- 
dent of observations. That is, given two quantities ip and 
X, the conditional probability distribution P(lmp\x, z) is 
independent of cluster selection by definition. Of course, 
in practice, the distribution of clusters in the ip~x plane 
of any given cluster sample must necessarily reflect not 
only the intrinsic probability distribution P(ln-0|x, z), 
but also how the clusters were selected in the first place. 
For instance, if a x-selected sample is complete above 
some threshold Xmin, the for any x-bin with x > Xmin, 
the sample mean of ln-0 is an unbiased estimator of 
(ln-0|x). By contrast, the sample mean of ln% for galaxy 
clusters in a given ip-bm is not an unbiased estimator 
of (lnx|?/>); relative to a complete sample of ip selected 
clusters, the Xmin limit removes systems with x < Xmin- 
Consequently, this selection effects must be explicitly ac- 
counted for when estimating (hiip\x)- The job of ob- 
servers is to model these selection effects, relating the ob- 
served distribution back to selection-independent quanti- 
ties like P(hiip\x, z). In so doing, one can fairly compare 
the results from different cluster samples, regardless of 
how they are selected @ 

Here, we will define the ip—x scaling relation to be the 
proability distribution P(hiijj\x, z). This choice of def- 
inition implies that, in the absence of systematics, the 
scaling relations estimated from optical, X-ray, and SZ 
cluster samples must necessarily be consistent with one 
another. Any tension them must necessarily reflect either 

8 In a cosmological context, one can forward model any selection 
effects into the observed distribution of galaxy clusters in the tp—x 
space. However, the nuisance parameters should be those charac- 
terizing P(lnip\x, z) if one is to be able to compare results from the 
different cluster catalogs, and/or relate these nuisance parameters 
to different astrophysical processes. 
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systematic errors in the data itself, or in the theoretical 
models used to relate the intrinsic probability distribu- 
tion P(lmp\x, z) to the observed cluster population. In 
this context, we aim to resolve the tension generated by 
the results of lPlanck Collaboration! (|2011d ) — henceforth 
referred to as P ll-opt — on the SZ signal of maxBCG 
galaxy clusters (|Koester et al.|[2007|) . 

Pll-opt predicted the Isz-A^oo scaling relation for 
the maxBCG syste ms using an X-ray model from 
Arnaud et al.l (12010T) and the scaling relations from 
Rozo et all (|2009al ). henceforth referred to as R09. They 
found that their predicted lsz _ -^2oo relation was much 
higher than the observed signal. We provide an expla- 
nation for the Pll-opt results that involves both types 
of errors noted above: unaccounted systematic errors in 
the optical data, the X-ray data, and the SZ data, and 
an incorrect model employed by Pll-opt to derive Ysz 
expectations under A^oo selection. 

Here, we construct a self-consistent model of cluster 
scaling relations that can adequately explain the Pll- 
opt data. Because the model used to predict the Y$z~ 
N200 scaling relation also involves X-ray data, we con- 
sider not just the lsz _ ^2oo scaling relation, but all SZ, 
Lx, and optical scaling relations that have been directly 
measured. We explicitly test whether cluster scaling re- 
lations derived from optical, X-ray, and SZ catalogs are 
consistent with each other, and we demand that our scal- 
ing relations be consistent with cosmological expecta- 
tions for the currentl y favored flat ACDM cosmological 
model from WMAP7 ([Komatsu et al.H201lD . We further 
demand that the spatial abundance functions in each of 
the observables be consistent with each other. 

This set of tests and self-consistency checks is necessary 
to resolve the problem posed by the Pll-opt data. For 
instance, consider a basic interpretation of the Pll-opt 
result as indicating that the mass scale of either optical 
galaxy clusters or X-ray clusters (or both) is incorrect. 
Any changes to the mass scales of these objects neces- 
sarily alters the predicted spatial abundances within a 
WMAP7 cosmology, so there is a very real risk that 
"fixing" the Isz-A^oo relation this way could "break" 
the predicted cosmological counts. Similarly, such a fix 
would affect other scalings, e.g., Lx-A^oo, Lx~M, etc., 
and the true multivariate model must be internally con- 
sistent. Broadly speaking, we must be able to close all 
possible loops in this multi-dimensional parameter space. 
The fact that scaling relations and cluster counts are 
so interconnected offers the potential to obtain precise 
constraints on multiv ariate scalings and cosmology from 
joint sample analysis dCunhal [20091 ). 

This is the third and final paper of a series that intends 
to explain the tension originally pointed out by Pll-opt. 
In lRozo et al.l ()2012cl hereafter paper I), we use pairs of 
clusters common to independent X-ray samples to mea- 
sure systematic uncertainties i n derived X - ray pro perties 
and inferred halo masses. In IRozo et all (|2012bl paper 
II), we use the results of paper I to characterize the im- 
pact of these systematic errors on X-ray cluster scaling 
relations. Combined, papers I and II develop the neces- 
sary foundations for our treatment of the lsz _ A^2oo scal- 
ing relation, the main aim of this paper. 

The paper is organized as follows. In £[3J we briefly 
summarize the various data sets we employ. In section 
[3] we consider the Fsz-A^oo relation, and discuss what 



is necessary to "fix" it. The remainder of the paper is 
focused on checking whether our fix of the Vsz-A^oo re- 
lation is consistent with all other data connected to it. 
In $4] we consider whether our modified scaling relations 
and mass calibrations are consistent with cosmological 
expectations, and whether the optical and X-ray cluster 
spatial abundances are consistent with one another. Sec- 
tion [5] compares the Lx~M and Y$z-M scaling relations 
derived from optical and X-ray selected cluster catalogs. 
In addition, we also compare the amplitude of the ther- 
mal SZ power spectrum predicted from our best fit Ysz - 
M scaling relation to measureme nts from the South Pole 
Telescope (Reichar dt et al.ll201lD . In we test the self- 
consistency of our scaling relations, testing whether we 
can combine the lsz~A 7 2oo aud ix _ A^2oo scaling relation 
to derive the observed Y§z~Lx scaling relation. In fJ7]wc 
take a closer look at the maxBCG mass calibration, de- 
rive our set of preferred scaling relations by combining 
maxBCG and V09 scaling relations, and then use these 
to predict masses for each of the CLASH systems with 
z < 0.4. We also compare our predicted masses with 
a broad range of works from the literature, and predict 
the amplitude of the thermal Sunyaev-Zeldovich power 
spectrum, and compare it with observations. Section [8] 
presents a summary and discussion of our results. 

When considering cosmological predictions 
for cluster spatial abundances, we adopt the 
best fit WMAPT+BAO+ffn fla t ACDM cosmol- 
ogy from iKomatsu et~aTl (|2011f ): cr 8 = 0.816, 
Ho = 70.2 km/s/Mpc, fl m = 0.274, and n s = 0.968. 
Cluster scaling relations were estimated using flat 
ACDM models with h = 0.7, though there is some 
variance in the value of fi m between various works, with 
TO G [0.27, 0.3] depending on the work. The impact of 
this level of variation in the recovered cluster scaling 
relations is < 5%. The measurement conventions we 
employ are given in Q2.2\ 

2. DATA SETS AND CONVENTIONS 

The data employed in this work is collated from a va- 
riety of papers cited in Papers I and II, and we direct 
the reader to these papers for details. We provide a brief 
summary in this section. 

A note regarding mass is in order. Throughout, we 
take halo mass, M, to be M5000 the mass defined within 
an overdensity of 500 with respect to the critical den- 
sity at the redshift of the cluster. While the true mass is 
not strictly observable, it can be inferred from observable 
properties in a model-dependent manner. The published 
X-ray samples we employ calibrate scaling relations using 
a subset of clusters with hydrostatic mass measurements, 
then derive mass estimates for the entire sample based on 
the implied scaling relation. Thus, our use of mass, M, 
should generally be interpreted as an observable rescaled 
to provide an estimate of M 50 o c (see Paper I). Statisti- 
cal inferences are unaffected by this complication, as the 
Msqoc estimates so derived are, by definition, statistically 
consistent with pure hydrostatic masses. 

2.1. Data Sets 

Our work with optically selected galaxy clusters is 
based on the Sl oan Digital Sky Sur vey (SDSS) maxBCG 
cluster catalog (|Koester et al.ll20"07l) . The optical observ- 
able is the red-sequence count of galaxies, also known as 
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optical richness, A^oo- The L x-N 2 qq and M-N 2 p o scal- 
ing relations are those from IRozo et all (|2009aL R09), 
who applied various correctio ns to the or i ginal s et of 
scaling relation s repo rted in iRvkoff et al.l (|2008f ) and 
iJohnston et all (|2007f ). The Isz-^nn scaling relatio n 
is that from the Pll-opt (jPlanck Collaboration! 1201 Id ), 
corrected for the effects of cluster miscen tering, as as 
per the results of iBiesiadzinski et al.l (|2012f ). and for the 
expected level of aperture-induced measurement bias in 
the data (see fj3j) - These corrections are applied on the 
binned Ygz data and scaling relations quoted in Pll-opt, 
and do not employ the raw Planck data in any way. 

Turning now to X-ray selected syste ms, most of our 
work is based on the data presented in IVikhlinin et al.l 
(|2009ft . henceforth referred to simply as V09. The V09 
galaxy clusters are X-ray selected, and the cluster masses 
arc estimated based on the M-Yx scaling relation, which 
is itself cal ibrated using hy d rostat ic mass estimates as de- 
scribed in IVikhlinin et al.l (|2006| ) . These cluster masses 
are used to estimate the Lx~M scaling relation, while 
the Ysz~M scaling relations is derived from the M-Yx 
relation above with the Y^z -Yx for the V09 clusters as 
quoted in lRozo et al.l ()2012al ). The derivation of the Ysz - 
M and Y§z~Lx scaling relation for the V09 data set is 
presented in paper II. 

In section EH w e also consider the X-ray data 
from iMantz et al.l (l2010bl). as well as that of 
Planck Collaboration! (12011b! ) (henc ef orth Pll) 



Arnaud et all ()201 



ion| (I2011bh 

iTH, and iPratt elaTl (poM ). 



all cases, mass calibration is obtained using hydro- 
static mass estimates (e.g. iPointecouteau et al.l 120051 : 
lArnaud et all 120071; I Allen et al.ll2008D . Concerning the 
Pll data, Pll distinguish between A and B systems on 
the basis of the apparent angular size of the cluster in 
the sky. All of the galaxy clusters in the redshift range 
of interest (z w 0.2 to match the maxBCG catalog) 
are A clusters. In paper I, we noted the X-ray data of 
clusters A and B are systematically different. Since we 
are interested in the comparison to maxBCG galaxy 
clusters, we follow our work on papers I and II, and 
focus exclusively on the Pll data for systems in the 
redshift slice z G [0.12,0.3], and use Pll(z=0.23) to 
denote this cluster subsample. We refer the reader to 
papers I and II for further details. 

Turning to the Y$z~Lx scaling relation, we consider 
data derived both from the Pll(z =0.23) data set, as well 
as the measurement reported in iPlanck Collaboration! 
(|2011af) . which we denote Pll-X. Pll-X estimates 
the Ysz-Lx scaling relation by stacking X-ray se- 
lected clusters from the compilation MCXC catalog 
(jPiffaretti et al.ll201~TI ). 

2.2. Conventions 

The optical richness, A2001 measures the number of 
galaxies within a g — r color-cut around the color of the 
galaxy designated as the brightest cluster galaxy, and 
within a scaled radial aperture^ X-ray luminosity, Lx, 
is defined as the rest-frame [0.1, 2.4] keV band luminosity 
within a cylindrical aperture of i?5oo- V09 has a some- 
what different definition, as does R09, but we correct 

9 Despite the subscript, this aperture is not a good estimator 
for the -R200 °f the cluster, and should only be considered as an 
optical "tag" that correlated with mass. 



for these differences (Paper I). We quote luminosity in 
units of 10 44 ergs/s. Finally, Y"sz is defined as the inte- 
grated SZ signal within the cluster radius i?5oo- In some 
cases, such as the analysis in Pll-opt, the aperture -R500 
used to estimate Ysz explicitly depends on the richness 
N200, which results in modest aperture- induced correc- 
tions. This is discussed in more detail in Section [31 and 
is also relevant to the discussion in section [5] Because 
the scaling relation lsz _ A^2oo explicitly depends on the 
angular diameter distance, we always quote scaling rela- 
tions with respect to D^Ygz rather than Ysz alone. For 
instance, we quote the amplitudes for the scaling rela- 
tions D\Y SZ -N and D\Y S z-Lx rather than Ysz-A^oo 
or Y§z~Lx- However, for brevity, we will refer to these 
D\ weighted relations as the Ysz-A^oo an< ^ ^sz~Lx scal- 
ing relations: the D 2 A factor will often be implied. In all 

cases, we measure D\Y$z in units of the 10 -5 Mpc 2 . 

Turning to scaling relations, given two arbitrary cluster 
observables ip and Xi wc assume the probability distri- 
bution P(ijj\x) is a log-normal distribution. As in paper 
II, we model the mean of this distribution as a linear 
relation in log-space, so that 

(\nip\x) = a^ x + a^ x ln(x/xo) (1) 

where clm x is the amplitude parameter, and oi^,\ x is the 
slope. The parameter xo is the pivot point of the relation, 
which wc always select so as to decorrelate the amplitude 
and slope parameters. The variance in Imp at fixed x is 
assumed to be constant, and is denoted as 



Var(lnV'lx) = 



2 



(2) 



We note that binned data naturally measures the mo- 
ment {tp\x) rather than (ln^|x). The two are related 



1 



ln((^|x)) = (lnV|x) + r4 



(3) 



We define the value of In (tp\x) &t the pivot of the 

scaling relation, so that 



a-0 



(4) 



We note that if the scatter, o~m x , is correlated with 
either the amplitude, clm x , or slope, a$\ x , of the scaling 
relation, as it often is, then a pivot point that decorre- 
lates o,m x and a^\ x need not decorrelate the mean linear 
amplitude, a^\ x , and slope, at^\ x - We explicitly take into 
account the difference between clm x and a,M x in all of the 
comparisons performed in this paper. 

To avoid any possible confusion, the subscripts we em- 
ploy in this work are as follows: m for mass, n for N200, 
x for Lx, and sz for D^Ysz- So, for instance, a sz \ m is 
the amplitude of the D\Ysz~M relation. 

3. THE Y sz -N 200 SCALING RELATION 

3.1. Data and Model Predictions 

We begin our investigation by exploring the Ysz _ 
iV 2 oo scaling relation measured by Pll-opt. The data 
is taken directly from that work, and is corrected for 
the effects of clust e r mis centering as per the results o f 
IBiesiadzinski et al.l (|2012l ) (see also lAngulo et alj|2012l) . 
The centering correction applied is summarized in Table 



4 



Rozo et al. 



TABLE 1 
VSZ-JV200 Data 



N200 D\Y S z/{W~ 5 Mpc~ 2 ) Centering Correction 



10- 


13 


0.058± 0.012 


0.74± 0.13 


14- 


17 


0.107± 0.020 


0.77± 0.11 


18 


24 


0.222± 0.028 


0.80± 0.08 


25- 


32 


0.394± 0.044 


0.82± 0.08 


33- 


13 


0.692± 0.074 


0.84± 0.07 


11 


58 


1.205± 0.130 


0.86± 0.07 


59 


77 


1.876± 0.241 


0.87± 0.07 


78 


104 


4.594± 1.009 


0.89± 0.09 



a The data in t he fir st two columns is from 
Planck Collaboration (2011c, Pll-opt), after being cor- 
rected for the effects o f cluster miscentering following 
Bicsiadzinski et al. (2012) (third column). The uncertainty 
in the corrections is added in quadrature to the observa- 
tional errors. 



[U along with the corrected data points. We have also 
added in quadrature the uncertainty associated with the 
miscentering corrections to the error budget of Pll-opt. 
We note that the errors in Pll-opt are in fact asymmet- 
rical, but we have opted to symmetrize them to simplify 
our analysis. Finally, we also reduce the observed ampli- 
tude by 0.05 to account for the Malmquist bias caused 
by aperture- induced covariance in the Ysz measurements 
(see below). 

We compare the Pll-opt data to the predicted Ysz - 
N200 relation obtained from combining the M-N200 scal- 
ing relation from R09, and the Y§z~M relations for the 
V09, M10, and Pll(z=0.23) data sets derived in paper 
II. These input scaling relations arc summarized in Ta- 
ble [3J along with the predicted Ysz~N 2 oo relations. We 
note that R09 report not the amplitude parameter a m \ n , 
but rather the parameter a m \ n that characterizes the 
mean (MlA^no)- A similar ca veat holds for Ysz--ZV 2 oo, as 
iPlanck Collaboration! (|2011d ) compute (lszl-^200) rather 
than (lnYgzl^oo)- In this section, we will only con- 
sider the statistical uncertainty in the amplitude of the 
M-N200 relation; the systematic error will be considered 
independently below. 

We use the formalism from Paper II to predict the Ysz - 
A^oo scaling relation from these input scaling relations. 
The amplitude and slope are given by 



|m^m|n; (6) 



^sz\n 

^sz\n (%sz\m^m\ni 



where r = r S z,n\m is the correlation coefficient of Ysz 
and -/V200 at fixed halo mass, and — (3 is the local loga- 
rithmic slope of the cosmic mass function. The scatter 
in the mass-richness relation <7 m i„ is large (0.45 ± 0.1), 
primarily reflecting a poor choice of richnes s estimator 
rather than a large intrins ic cluster variance (R ozo et al.l 
I2009bt iRvkoff et al.llMl) . Consequently, we do not ex- 
pect the richness and SZ scatter to be strongly correlated 
at fixed mass. For r SZin \ m = 0.1, (3 = 3, o m \ sz = 0.45 
and <J m \sz =0.1, we find that the amplitude correlation 
term above is « 2%, much too small to be of relevance 
for this study. 

On the other hand, in a recent work, lAngulo et al] 
(|2012t) used the Millenium-XXL simulation to estimate 
the correlation coefficient between Ygz and N200, finding 
a value of r = 0.47. We expect the large correlation 



coefficient of lAngulo et al] (|2012D reflects aperture ef- 
fects. Specifically, we wish to predic t YSZ--/V200 when Ysz 
is measured within -R500, whereas lAngulo et al] (|2012l ) 
explicitly measure Ysz within the radius (-R500 1^200 }• 
We can estimate th e correlation coefficient quoted in 
lAngulo et al.l (|2012h by noting that the scatter of the 
Ysz~M scaling relation from V09 is cr sz \ m = 0.12. We 
also know from paper I that a bias b m in the observed 
mass induces a bias of ~ 0.31n6 TO in In Ysz due to 
aperture effects. Since the optical scatter in mass is 
0Yn|n ~ 0.4, the induced scatter in Ysz via aperture ef- 

fects is (S 2 sz mduced ) 1/2 « 0.3 x 0.4 « 0.1. This scatter is 

perfectly correlated with richness, while the total scatter 
is the intrinsic scatter plus the induced scatter added in 
quadrature. The resulting, aperture-induced correlation 
coefficient is 



r = ■ 



{(&sz,int ~t~ & s z ,induced 



a 



s zlrn yj a. 111 



0.1 x 0.4 



(0.1 2 + 0.12 2 ) 1 /2 x0.4 



0.6, 



(7) 



in reasonable agre ement with r = 0.47 quoted in 
lAngulo et al.l ([2012D . The total scatter (intrinsic + in- 
duced) is ~ 0.16, also in reasonable agreement with 
lAngulo et all (|2012t ) who find a sz \ m = 0.18. 

So which correlation coefficient should one adopt? If 
one wishes to predict Ysz-A^oo where Isz is measured 
within -R500, as we do, then one should set r 



sz,n\m 



since Ysz within -R500 is not subject to aperture effects. 
That said, when we compare our predictions to the Pll- 
opt data, we need to correct the Pll-opt data for the 
aperture-induced effect estimated above. Thus, even 
though we set r sz n \ rn = in our analysis, our com- 
parison to data explicitly takes into account the impact 
of aperture- induced covariance. We note, however, that 
the fact that the Pll-opt measurements are template- 
amplitude fits rather than cylindrically integrated Ysz 
measurements will reduce the impact of said biases, as 
inner radii acquire higher statistical weight than in the 
case of a cylindrical integration. The naive bias estimate 
of rpa sz \ m a m \ sz <j m \ n is w 0.1 for r = 0.5, which sets an 
upper limit to the correction appropriate in the case of 
template amplitude fits. For our purposes, when compar- 
ing the Pll-opt data to our predictions, we will decrease 
the Pll-opt data by half of this amount, and note that 
this correction has a ±5% systematic uncertainty in the 
amplitude of the Ysz~M relationFl 
The scatter of the Ysz-A^oo relation is given by 



t7 ' sz\n a sz\m 



®m\sz ^sz,n\m^m\n^7n\sz 



(8) 



We follow the same procedure as in paper II to estimate 

10 There are some additional secondary effects that should re- 
du ce the correlat i on co efficient between Ygz and iV 200 relative to 
the lAng ulo et al. (2012) measurement. In particular. lAngulo et all 
l|2012l ) base their predictions on projecting the galaxy density field 
across the full simulation box, which is 4.11 Gpc. This is to be 
compared to the comoving width of the red-s equence, which is 
~ 0.1 Gpc. In addition, the Angulo et al. (2012) measurement in- 
cludes miscentering induced covariance, whereas we are explicitly 
correcting the data for miscentering. To the extent that the dom- 
inant effect are the aperture-induced corrections, however, these 
differences should play a minor role. 
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TABLE 2 

Input (Ygz-M and M-JV200) AND Derived (Ysz-^oo) Scaling Relations at z = 0.23 



Relation 


X0 


P 




a 


^ln ip | x 


Sample 


D 2 .Y SZ -M 


4.8 


2.75 


1.34±0.07 a 


1.61 ±0.11 


0.12 ±0.03 


V09 


D\Y SZ -M 


5.5 


2.93 


1.97 ±0.06 


1.48 ±0.12 


0.20 ± 0.04 


Pll(z=0.23) 


D\Y SZ -M 


10.0 


3.95 


2.54 ±0.20 


1.48 ± 0.09 


0.15 ±0.03 


M10 


M-N200 


10 




0.95 ± 0.07^ 


1.06 ± 0.08 (stat) ±0.08 (sys) 


0.45 ±0.1 


maxBCG 


D\Y S z-N 20Q ° 


70.0 




1.13 ±0.17 


1.70 ±0.18 


0.78 ±0.19 


V09 


D 2 A Y SZ -N 200 T 


70.0 




1.57 ±0.16 


1.57 ±0.18 


0.74 ±0.18 


Pll(z=0.23) 


D\Y sz -N 20Q b 


70.0 




1.26 ±0.26 


1.57 ±0.16 


0.73 ±0.17 


M10 



a We assume the form (\nip\x) = lnV>0 ± am (x/xo)- Units are 10 14 Mq for mass, and 10" 5 Mpc 2 for D^Vsz- The 
quantity ft is the slope of the halo mass function at the pivot scale of the Yg z -M relation. The Ys z -N 2 oo relations 
are derived by combining each Yq z -M relation with the R09 M-N200 relation. 

b The systematic error in the amplitudes (~ ±0.1) are considered independently. 

c We emphasize that even though we plot In (D^Ygz \N200) so as to match the Pll-opt data, the amplitude reported 



here is a„ 



rather than 5, 



(see text). 



the 68% confidence region of the Ysz-A^oo scaling rela- 
tion parameters: we randomly sample the M-N200 and 
^SZ _ ^2oo parameters from the appropriate priors, and 
use the above equations to compute the distribution of 
the corresponding Ysz-A^oo parameters. The predicted 
scaling relations for each of the three data sets under con- 
sideration — V09, M10, and Pll(z=0.23) — are summa- 
rized in Table El 

As this work was being completed, iNoh fe CohrJ 
(|2012t) published another study on observable covari- 
ancc. From their table 2, we compute a correlation co- 
efficient r sz n \ m sa 0.4 in reasonable agreement with the 
lAngulo et al.l (|2012f) value, and possibly at odds with 
our interpretation of the latter value as due primar- 
ily to aperture-induced covariance. If the covariancc 
in the cluster observablcs if fully dominated by the lo- 
cal clust er triaxiality and/or local filamentary structure 
(see e.g. IWhite et al.l[2010l: INoh fe Cohdl20ll. the n the 
good agreem e nt be tween the lAngulo et al.l ( 20121 ) and 
INoh fc Cohnl (|2012h would be expected. Regardless of 
the origin of the covariance, we emphasize that our com- 
parison to the maxBCG data explicitly incorporates the 
impact of such covariance as described above. 

3.2. Results 

The top panel in Figure Q] compares the lsz _ -^2oo re ~ 
lation predicted using the V09, M10, and Pll(z=0.23) 
scaling relations to the Pll-opt data. Note in Figure 
rjwe plot ln(£^Y S z|JV20o) rather than (lnD^Fszl^oo) 
to match the Pll-opt data. The bottom panel shows 
this same data after subtracting the (iszlA^oo) fit re- 
ported in Pll-opt (the points do not scatter about zero 
because of the miscentering and aperture-induced bias 
corrections applied). There is reasonable agreement be- 
tween the Planck-m&xBCG data and the V09 and M10 
models at high masses (M >5x 10 14 Mq), but all scal- 
ing relation predictions fail at low masses. Note that 
because the pivot point of the predicted scaling relations 
is -/V200 = 70, the offset shown in Figure Q] is not simply 
a difference in slope: there is a significant difference in 
amplitude as well. 

We emphasize that the uncertainty in our model pre- 
dictions are comparable to or larger than the errors in 
the Pll-opt data. Moreover, these theoretical uncertain- 



ties are very strongly correlated, so x 2 -by-eye is grossly 
misleading. For instance, reducing the amplitude of our 
V09 model prediction by 3er results in good agreement 
between the Pll-opt data and our model prediction. In- 
deed, the tension between the V09+R09 prediction and 
the Pll-opt data is just under 3cr. 

We make this quantitative by computing x 2 , adding in 
quadrature the observational errors from Pll-opt to the 
covariance matrix for the V09 predictions. We evaluate 
goodness of fit by generating 10 5 Monte Carlo realiza- 
tions of the data using the full covariance matrix, and 
empirically compute P(x 2 > Xobs)- The corresponding 
probabilities for each of the three models we consider are 
P = 0.005 (2.8cr, V09), P < 10- 5 (> 4.4cr,Pll(z=0.23)), 
and P = 0.06 (1.9a, M10). Evidently, the tension is not 
anywhere nearly as strong as it appears by eye, though 
the Pll(z=0.23) model is ruled out at high significance. 
If we used only the diagonal terms in the full covariance 
matrix, we find Xdiag = 71, which demonstrates how 
grossly misleading "x 2 -by-eye" estimates can be. 

Reconciling the lsz — A200 data with our predictions 
requires that at least one of the input scaling relations 
that were used in our predictions be incorrect. R09 al- 
lows for a 10% systematic error offset in the weak lens- 
ing masses used to construct the M-N200 relation, but 
lowering the cluster masses by said amount is not suffi- 
cient to remove the tension from the data. If R09 have 
not underestimated their systematic uncertainty, the re- 
maining discrepancy would have to be in the X-ray mass 
estimates. Because all X-ray works relied on hydrostatic 
mass calibration, the most obvious source of bias is non- 
thermal pressure support in galaxy clusters. We con- 
sider a fiducial model in which there is 15% hydrostatic 
bias, meaning hydrostatic masses arc 15% lower than the 
true cluster masses when compared at a fixed aperture. 
This value is typical of what is pred icted from numer 
ical simulations (» 10% — 25%, e.g . iNagai et al.l 
Lau et al.ll2009tlBattaglia et al.ll201~l1 iNelson et al.l 



Rasia et al.ll2012l 



2007 



2011 



Because cluster masses are defined using a constant 
overdensity criteria, the bias between the true and re- 
ported M500 values is large r than the mass bias at fixed 
aperture. For an NFW (|Navarro etai] Il99ll profile 
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Fig. 1. — Top panel: Comparison of the predicted (Yszl-^oo) 
relation for each o f the data sets consid ered in thi s work (V09, 
M10, Pll) to the IPlanck Collaboration! (Pll-opt, I2011d) data, 
as labelled. All bands cover the 68% confidence regions for 
ea ch individual scalin g relat ion. The miscentering corrections 
of Bicsiadzinski ct al. (2012) have been applied on the data 
drawn from Pll-opt. We have also applied the aperture-induced 
Malmquist bias corrections (see text). Bottom Panel: As top panel, 
but after subtracting the (Yjjz 1-^200 ) fit from Pll-opt. The data 
points do not scatter about zero because of the corrections ap- 
plied. We emphasize that the uncertainty in the model predictions 
are strongly correlated, so x 2_ by-eye is grossly misleading. For 
instance, lowering the predicted amplitude for the V09 model by 
Rj 3cr results in good agreement with the data, reflecting the fact 
that the tension between the Pll-opt data and the V09 model is 
only 2.8<T. 

with R 5aQ /R s = 3 (e.g. IVikhlinin etHI [2006h . we find 
M(R) oc R 088 at R = R 500 - Assuming M obs (R) = 
bMtrue(R), and expanding in a power- law around -R500, 
we find that the bias in the reported M500 mass is 
M$$/MtJoo e = b 1A , or 21% in our case. This, then, 
is the total mass bias we ascribe to the X-ray masses. 

The top-left panel in Figure [5] compares the predicted 
scaling relations with the Pll-opt data after including 
these two corrections, i.e. lowering the masses of R09 by 
10% while raising X-ray masses by 21%. Possible ori- 
gins of the 10% downwards shift of the maxBCG masses 
relative to R09 will be discussed in section [5] For the 
time being, we just wish to investigate whether the scal- 
ing relations modified in this way provide a good de- 
scription of the data. As shown in Figure [2] the V09 



and M10 models result in excellent fits to the data, 
with P(x 2 > xlbs) = 0-83 and 0.52 respectively. The 
Pll(z=0.23) model has P{\ 2 > X 2 obs ) = 0-01, and re- 
mains in tension with the data at the 2.5a level. 

We have also considered what happens if we increase 
the bias in either the weak lensing masses or the X-ray 
masses. In either case, increasing the mass bias by up to 
ps 20% results in good agreement between the predicted 
and observed Ysz-A^oo relations. Note, however, that 
the absence of any is ruled out. Consequently, we can 
think of the Pll-opt as placing a lower limit to the hy- 
drostatic and weak lensing bias in the data, while placing 
only a very weak upper limit on these biases. 

We again emphasize that x 2 -by-eye can be very mis- 
leading, and that a better sense of the agreement or ten- 
sion between the Pll-opt data and our predictions can 
be obtained by comparing the observed amplitude, slope, 
and scatter of the isz-^oo relation to our predictions. 
This comparison is shown in the remaining panels of Fig- 
ure [2] We focus on the V09 measurements alone to avoid 
overcrowding the figure. We take a sz \ n = 0.7 ± 0.15 as 
the scatter value from Pll-opt, which is broadly consis- 
tent with their Figure 4 across a large richness range. 
For this comparison, we have refitted the Pll-opt data 
after miscentering corrections. We note that the ampli- 
tude parameter plotted is a sz \ n rather than a sz |„, so as 
to match the Pll-opt data. There is good agreement be- 
tween the predicted amplitude, slope, and scatter of the 
^sz _ -^200 relation, with significant overlap between the 
two distributions. Because there is covariance between 
the amplitude and slope of our predicted relation, a sz \ n 
and a sz \ n are correlated despite the fact that we chose 
our pivot point to decorrelate a sz \ n and a sz \ n . 

3.3. The MCXC sub-sample of maxBCG Clusters 

One question that we have not yet addressed is that 
of the SZ signal of the MCXC subsample of maxBCG 
galaxy clusters. Pll-opt find that when they stack 
the sub-sample of maxBCG galaxy clusters that are 
also in the MCXC catalog (jPiffaretti et al.1 1201 lh - 
a heterogeneous compilation of X-ray selected cluster 
catalogs — then the observed Yjgz signal is boosted 
by a large amount, and appears to be in reason- 
able agreement with the Pll-opt predicted scaling re- 
lation. The fact that the SZ signal for the maxBCG- 
MCXC cluster subsample is higher than that of the full 
maxBCG sample has already been quan titatively ad- 
dresse d by iBiesiadzinski et all (|2012l ) and lAngulo et al.l 
(|2012D . Consequently, we do not see a need for us to 
repeat their calculations here. 

We would like, however, to present a simple qualita- 
tive argum e nt that addresses these results. As noted in 
iRozo et all (|2009aT ). the scatter in mass and Lx at fixed 
richness are very strongly correlated (r > 0.85 at 95% 
CL). This reflects that fact that the maxBCG richness 
estimator is very noisy, much noisier than Lx- Conse- 
quently, at fixed richness, a brighter cluster is always 
more massive. By the same token, a brighter X-ray clus- 
ter will also have a higher Yjgz signal. Given a richness 
bin, a selection of the X-ray brightest clusters in the bin 
necessarily "peels off" the high SZ tail of the clusters 
in the bin. Hence, the SZ signal of the X-ray bright 
maxBCG sub-sample is higher than that of the full sam- 
ple, as observed. Moreover, as one goes lower and lower 
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Fig. 2. — Top Left: Comparison of the predicted (Ygz A^20o) relations to the Pll-opt data (corrected for misccntcring and aperture- 
ind uced Malmquist bia s). The prediction assumes a 15% hydrostatic bias (at fixed aperture, 21% total bias) in X-ray masses, and reduced 
the Rozo ct al. (2009a) mass normalization by its allotted systematic uncertainty (10%). With these corrections, the V09 and M10 models 
are in good agreement with the data. Remaining panels: 68% and 95% confidence contours for the observed (purple curves) and predicted 
(solid curves) Ygz— A^ou scaling relation parameters. The Ysz^^oo prediction focuses on the V09 model only. 



in richness, a given luminosity cut will peel off systems 
that are further out in the tail of the mass distribution. 
That is, the lower the richness bin, the stronger the im- 
pact that the X-ray selection has on the recovered signal, 
in agreement with the Pll-opt data. 

Thus, we believe the apparent agreement between the 
prediction in Pll-opt and the MCXC-maxBCG sub- 
sample of galaxy clusters is entirely fortuitous. It is clear 
that the strong covariance in Lx and Y"sz at fixed richness 
predicts an increase in the SZ amplitude, and a flatten- 
ing of the slope for the X-ray bright subsample. Since 
the full sample starts below the Pll-opt prediction, the 
X-ray selection necessarily improves agreement with the 
model. This, however, is a selection effect that happens 
to move us towards the model prediction, rather than 
the result of a self-consistent model of cluster scaling re- 
lations. Indeed, the Pll-opt prediction is explicitly a 
prediction for the full maxBCG sample, not for an X- 
ray bright sub-sample. Once one accounts for selection 
effects, the model curve for the MCXC-maxBCG clus- 
ter sub-sample in Pll-opt would move upwards by the 



same amount that the Pll-opt data did, preserving the 
original tension. 

In short, the big puzzle presented by the Pll-opt 
data is not the apparent agreement between the Pll- 
opt model and the MCXC-maxBCG cluster sample - 
that is simply a fortuitous coincidence — the question is 
why did the model fail in the first place. 

3.4. The Bigger Picture 

From Figure [2j we see that the V09 model comes clos- 
est to the Pll-obs data. In the interest of brevity, we 
will henceforth focus exclusively on the V09 model. The 
results from papers I and II allow us to quickly infer 
how the M10 and Pll(z=0.23) models will behave: M10 
generically agrees with V09 at high masses, but extrap- 
olates poorly to low masses due to the constant f gas as- 
sumption, while the Pll(z=0.23) masses are biased low 
by w 20% relative to V09 across the observed range. 

Having restricted ourselves to the V09 model, we have 
shown that a 10% overestimate of the optical masses, and 
a 15% hydrostatic bias (at fixed aperture, 21% total bias) 
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results in excellent agreement with the data. To take this 
solution seriously, however, these mass calibration offsets 
need not only explain the Ysz - -^200 data, they must also 
be able to fit all multi-wavelength data available. 

For the remainder of this work, we consider whether 
these bias-corrected V09 and R09 scaling relations can 
satisfy this requirement. More specifically, for our model 
to be successful it must satisfy the following conditions: 

1. Both optical and X-ray cluster spatial abundances 
must be consistent with cosmological expectations 
in a WMAP7 cosmology. 

2. Optical and X-ray spatial abundances must be con- 
sistent with each other. 

3. The model must fit all scaling relation data, from 
both optical, X-ray, and SZ selected cluster cata- 
logs. 

4. We must be able to self-consistently use two ob- 
served scaling relations to predict a third, and the 
predictions must agree with observations. 

We now turn to address each of these points in turn. 

4. SPATIAL ABUNDANCE CONSTRAINTS 

In paper II, we considered the cosmological consis- 
tency of the V09 Lx~M scaling relation. Specifically, 
we showed that by convolving the Lx~M relation o f 
V09 with the halo mass f unction dTinker et all 120081 ) 
in a WMAP7+ BAO+ffn dKomatsu et all 120111) or a 
WMAP7+BOSS (jSanchez et al.l2012f ) cosmology, we can 
succesfull y reproduce the observ ed REFLEX luminosity 
function ([Bohringer et al.l l2002f ) . An obvious question 
that arises is whether the bias-corrected V09 scaling re- 
lation is still consistent with cosmological expectations. 
Moreover, one must also demand that the spatial abun- 
dance of X-ray galaxy clusters be consistent with the 
spatial abundance of optical galaxy clusters. That is, 
convolving n(iV 2 oo) with P(Lx\N 2 oo) one should recover 
the X-ray luminosity function. We now test both of these 
conditions. 

To estimate the X-ray luminosity function from 
maxBCG data, we randomly sample the L x-A2no re- 
lation parameters (including scatter) from IRozo et"al"1 
(|2009a[) , and use these to randomly assign an X-ray lumi- 
nosity to every cluster. We then construct the cumulative 
luminosity function n(Lx) by dividing the recovered spa- 
tial abundance by the volume sampled by the maxBCG 
catalog. The whole procedure is iterated 10 4 times, and 
we compute the average cumulative luminosity function 
along with the corresponding uncertainty, defined as the 
standard deviation of our Monte Carlo analysis. We also 
correct the luminosity function for the expected evolu- 
tion between z = 0.08, the median redshift of the RE- 
FLEX sample, and z — 0.23, the median redshift of the 
maxBCG sample, though we find this evolution is negli- 
gible for our purposes. 

Figure [3] shows the maxBCG luminosity function as 
a yellow band. Also shown are the REFLEX luminos- 
ity function (red lines) and the V09 prediction (black 
solid lines) for z = 0.08 using the bias-corrected scal- 
ing relation and the WMAP7+BOSS cosmological con- 

1 /2 

straint = 0.441 ± 0.013 (Sanchez, private corn- 
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Fig. 3. — Comparison of the X-ray luminosity function predicted 
from the maxBCG dat a (yellow band) to the REFLEX luminos- 
ity function (red band) (Bohringer et al. 2002), and the prediction 
obtained by convolving the Tinker et al. (2008) mass function and 
the bias-corrected V09 Ly^-M scaling relation (black band). That 
is, we assume hydrostatic X-ray masses suffer from a 15% bias 
at fixed aperture (21% total bias). The cosmological model for 
the V09 predicti on is a flat ACDM cosmology as constrained by 
WMAP7+BOSS JSanchez et al.120121) . We emphasize that the yel- 
low band from maxBCG data is completely independent of mass 
calibration uncertainties, and relies only on the maxBCG abun- 
dance and the LX-N200 scaling relation. 



munication, based on lSanchez et al.ll2012l) . All three lu- 
minosity functions are in excellent agreement, with all 
relative offsets being significant at less than 0.3o\ In 
principle, we should sample all cosmological parameters, 
but because the f ull covariance matrix of the cosmolog- 
ical parameters in I Sanchez et al.l (|2012t ) is not yet pub- 
licly available, we limited ourselves to sampling only the 
1/2 

combination a^ilm ■ We note, ho wever, that when usin g 
the WMAP7+BAO+ J ff chains of lKomatsu et al.l (|20 111) , 
we find that varying all cosmological parameters or only 

<j%Vlm yields ver y similar results, as ex pected (see e.g. 
the discussions in I Weinberg et al.ll2012|) . 

We can also directly test whether the maxBCG func- 
tion and the M-N200 relation from R09 are consistent 
with cosmological expectations. We randomly draw the 
parameters of the M-N200 relations from Table [2l and 
use the resulting P(M|iV2oo) distribution to randomly 
assign masses to each maxBCG galaxy cluster. Know- 
ing the volume sampled by the maxBCG systems (z £ 
[0.1,0.3], 7398 deg 2 ), we compute the corresponding cu- 
mulative mass function , and compare it to predictions 
from the iTinker et all (|2008f) mass func tion, sampled 
over the cosmological constraints from iSanchez et al.1 

(2012). As above, we sample only as&Jj 2 , holding the 
remaining cosmological parameters fixed. The whole pro- 
cedure is iterated 10 3 times, and the mean and variance 
of the difference between the two mass functions is stored 
along a grid of masses. The 68% confidence band is es- 
timated using the ±cr region where a is the measured 
standard deviation. 

Figure Q] shows the difference between the cluster and 
halo mass functions, in units of the Poisson error. The 
statistical error in the spatial abundance is super- Poisson 
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Fig. 4. — Comparison of the mass function predicted from 
the maxBCG data to the ITinker et all l l200gf) mass function 
for a flat ACDM co smology as constrained by WMAP+BOSS 
USanchez et al.l [20121) . Lowering the maxBCG masses by 10% is 
consistent with cosmological expectations. A 20% shift, how- 
ever, starts to be in mild (2cr) tension with the data. Thus, we 
take this as roughly the maximum correction compatible with the 
Sanchez ct al. (2012) results. 



(|Hu fc Kravtsovl 120031) . but the uncertainty in the dif- 
ference is actually dominated by uncert ainties in cluster 
mass es rather than cluster statistics (e.glWeinber g etaD 
I2012D . The yellow band in Figure H shows the 68% con- 
fidence contours for our fiducial case. The fact that this 
band covers the lin e y = demonstrates that the fiducial 
iRozo et all (|2009a( ) M-iV 20 o relation and the maxBCG 
spatial abundance function are consistent with cosmolog- 
ical expectations. The up-turn at low masses marks the 
scale at which the maxBCG catalog becomes incomplete. 

The red dashed lines in Figure H] shows the 68% confi- 
dence contours for the difference between the cluster and 
halo mass function s after reducing the amplitude of the 
IRozo et all (|2009a| ) M-N 200 relation by 10%. The differ- 
ence between the resulting maxBCG data and the theo- 
retical mass functions is « lAa, so this shift in cluster 
masses is statistically allowed. Lowering the amplitude 
by 20% — shown as the green dashed-dotted curves — 
results in a marginally unacceptable fit, with 2.2cr ten- 
sion. Thus, a 20% bias on the weak lensing mass scale 
from R09 is roughly the maximum level of bias allowed 
from a cosmological perspective. 

By the same token, one may ask what level of hydro- 
static bias does the X-ray data allow, in the sense that 
the predicted and observed X-ray luminosity functions 
must be consistent with each other. We find that up to a 
« 30% hydrostatic bias at fixed aperture (ps 40% total) 
is acceptable (< 2a) when using WMAP7+BOSS priors. 
These results are entirely consistent with the biases re- 
quired to explain the Ysz-A^oo relation, i.e. 10% bias 
in the maxBCG masses, and a 15% hydrostatic bias in 
X-ray masses. Interestingly, this also demonstrates that 
cosmological considerations can place an upper limit for 
the hydrostatic and weak lensing biases in the data. In 
conjunction with the lower limit placed by the lsz-A 2 oo 
relation, these limits nicely bracket our proposed solu- 
tion. 



5. OPTICAL AND X-RAY CONSISTENCY 

V09 calibrated the Lx~M scaling relation, as well as 
the M-Yx sc aling relations. Usi ng the Ygz-Yx scaling 
relation from IRozo et al.l (|2012aD . in paper II we used 
this data to also produce a V09 Ysz~M scaling relation. 
We now derive the Ly^-M and Y§z,-M relations from 
maxBCG data, and compare them to the V09 results. 

5.1. Method 

We rely on a Monte Carlo method to compute the scal- 
ing relations predicted from maxBCG. Specifically, we 
use the M-N 2 oo, Lx-N 20 o, Ysz~N 20 o scaling relations to 
randomly assign masses, X-ray luminosities, and SZ sig- 
nals to each maxBCG galaxy cluster. We then select a 
mass-limited cluster sub-sample with M > 2 x 10 14 M & 
— comfortably above the completeness limit of the sam- 
ple, see Figure |4] — and we fit for the resulting scaling 
relation parameters. The whole procedure is iterated 10 
times. Each scaling relation is then evaluated along a 
grid in mass, and the mean and standard deviation at 
each point computed. The 68% confidence intervals are 
estimated as the ±tr regions where a is the standard de- 
viation. 

There is one additional key consideration when pre- 
dicting the scaling relations from the maxBCG data: the 
cluster variables Lx, ^sz, and M, are all tightly corre 
lated with each other at fixed richness (R09 ; |Stanek et al 



2010l:IWhite et al.ll2010l : lAngulo et al.ll2"oTllNoh fc Cohn 



20121 ) . This reflects the fact that A^oo !S a very poor mass 
tracer with very large scatter: a cluster that is brighter 
in X-rays will also be more massive and have a higher SZ 
signal. We set the correlation coefficient between the var- 
ious observables using the local power-law multi-variate 
scaling relations model detailed in Appendix A of paper 
II. Specifically, the covariance between a quantity ^ and 
m at fixed richness (subscript n) is given by 



[l - r 2 + (cr m |„/(T m | V , - r)' 



I 1/2 



(9) 



where r is the correlation coefficient between x and n 
at fixed m. We take the simplifying assumption that 
this intrinsic covariance is zero (r = 0), which implies 
r x,m\n = 0-90 and r sz m \ n = 0.98. Since the mass scatter 
at fixed optical richness {o~ m \ n ) is significantly larger its 
counterpart at fixed X-ray luminosity (a m \ x ), the precise 
value of r has only a mild impact; varying r £ [—0.5, 0.5] 
varies r x m \ n over the range r x>m \ n £ [0.86,0.93], with 
even a smaller range of variation for r sz m \ n . 

Equation ([9]) specifics the correlation coefficient we em- 
ploy when assigning cluster properties to the maxBCG 
galaxy clusters. When evaluating the correlation coeffi- 
cient, we hold the scatter cr m \ x fixed to its central value. 
This is because the main source of scatter in T x ^ rn ^ n is the 
uncertainty in a m \ n . In addition, it avoids introducing 
covariance between fluctuations in the scaling relation 
predicted from maxBCG data and the V09 predictions. 
Because tx m i„ is drawn for each Monte Carlo realization, 



and 



sz,m\n 



each realization has an independent r x 
estimate. 

To compute the 68% confidence regions for the V09 
scaling relations, we rely instead on the method used in 
paper II: the scaling relations parameters are randomly 
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sampled 10 5 times from the priors in Tables [2] and [3J 
These are used to compute the scaling relation parame- 
ters using Appendix A in paper II, which we in turn us 
to evaluate the mean and standard deviation of the scal- 
ing relations along a grid of masses. The 68% confidence 
intervals are estimated as above. In all cases, the biased- 
corrected scaling relations are computed by modifying 
the amplitude of the bias-free scaling relations involving 
mass by the appropriate amounts (10% for R09, 21% for 
V09). We also rescale cluster observables defined within 
i?50o as in papers I and II to account for the change in 
aperture due to mass rescaling, though we note these cor- 
rections are typically small relative to the impact of the 
mass offset. 

5.2. Results 

The top-left panel in Figure [5] shows the 68% confi- 
dence contours for the V09 and maxBCG Lx~M scaling 
relation. Both sets of relations are bias-corrected, and 
are in excellent agreement with each other. The scat- 
ter, too, is in good agreement, with <7\ n L x \M = 
for the maxBCG prediction, and ci n L x jM = 0.40 ± 0.04 
for the V09 scaling relation. This agreement is better 
illustrated in the remaining panels, where we explicitly 
compare the predicted amplitude, slope, and scatter of 
the Lx~M relation from the two works. In all cases, the 
filled contours correspond to the maxBCG bias-corrected 
scaling relations, while the solid curves demarcate the 
bias-corrected V09 scaling relation. 

Figure |6] mirrors Figure [5j only there we are consid- 
ering the Ysz-M scaling relations. The top-left panel 
shows the 68% confidence interval in the Ygz~M plane, 
while the remaining panels show the 68% and 95% con- 
fidence contours for the scaling relation parameters. In 
all cases, we consider only the bias-corrected scaling re- 
lations. Even though the agreement is less good than in 
the case of the Lx~M relation, the agreement between 
the two data sets is still adequate. 

Despite the two data sets being in statistical agree- 
ment, it is worth asking what would be required to im- 
prove upon the current status, particularly in the case 
of Ysz-M, where the agreement is less good. Because 
the maxBCG band falls below the V09 data in the Ysz - 
M plane, but above the V09 data in the Lx~M plane, 
shifting the cluster masses will increase tension in one 
relation while alleviating the other, and is therefore not 
a good avenue for improving things. In addition, the X- 
ray luminosities are tied down by the spatial abundance 
constraints from the previous section. Consequently, any 
improvement to the current scenario would have to come 
from « 10% — 20% changes in the Yjgz measurements: ei- 
ther the V09 Ysz measurements would have to go down 
(e.g. due to Malmquist bias due to SZ cluster selection), 
or the Pll-opt measurements would have to go up. Since 
the V09 Ysz/Yx ratio is already on the low end of what 
is expected, an increase of the Isz signal of maxBCG 
clusters is the most promising avenue. 

If we repeat this analysis using the original maxBCG 
and V09 scaling relations, we find that while the two data 
sets are in modest agreement with regards to the Lx~M 
relation, the Ygz~M relation of the two works are clearly 
discrepant. This disagreement reflects the original ten- 
sion in the Ysz-^oo relation, so it is not surprising that 
fixing the Ysz-A^oo relation also results in consistent es- 



timates for the Ygz~M relation. That said, it is far from 
trivial that this same solution is consistent with the Lx~ 
M relation, and with the spatial abundance constraints 
discussed in the previous section. 

6. CLOSURE TEST: THE Y sz -L x RELATION 

We have seen that the bias-corrected Lx~M and Ysz - 
M maxBCG and V09 scaling relations are in good agree- 
ment with each other, and are also consistent with cos- 
mological expectations. We now test whether the direct 
observable scaling relations are self-consistent. Specifi- 
cally, we test whether the Ygz—Lx relation predicted from 
the lsz _ ^200 and ix _ A r 2oo relations is consistent with 
measurements from X-ray cluster catalogs. Further, we 
will also compare the predicted Ysz~Lx scaling relations 
to the bias-corrected prediction from the V09 model. 

The maxBCG predictions are computed as before: we 
randomly assigned Lx and Y"sz values to clusters in the 
catalog, including the effect of the correlation coefficient. 
The correlation coefficient between Isz and Lx at fixed 
richness is estimated following an argument similar to 
the previous section. Specifically, we assume r n x \ sz = 0, 
and estimate r XfSZ \ n using the formulae in Appendix A of 
paper II. Typical values for r x<sz \ n lie in the range [0.8, 1], 
as expected. We then cut the resulting cluster catalog at 
Lx > 8 x 10 43 ergs/z, which is sufficiently high to not 
be affected by completeness issues, and fit the data to 
arrive at our model amplitude, slope, and scatter about 
the mean. The V09 predictions are performed exactly as 
in paper II, where we also considered the Ygz-Lx rela- 
tion for the V09, M10, and Pll data sets, only we now 
consider the bias-corrected V09 scaling relations only. 

We will be comparing the maxBCG and V09 pre- 
dictions for the Ysz-Lx scaling relation to those de- 
rived from the Pll(z=0.23) data, and that obtained 
by stacking X- ray galaxy clusters from the MCXC 
cluster catalog dPiffaretti et al.l 120 111) as described in 
iPlanck Collaboration! (1201 laf ). hencerforth referred to as 
Pll-X. For the Pll(z=0.23) data, we fit the clusters in 
Pll in the rcdshift range z £ [0.13,0.3], and modify the 
recovered amplitude as described in Pll. We do not at- 
tempt to measure the scatter because of selection effects. 
Rather, our fits assume an intrinsic scatter in Ygz—Lx of 
0.4, which is consistent with all data. 

As for the Pll-X fits to the Ysz~Lx scaling relation, 
we perform three independent corrections: 1) Malmquist 
bias corrections due two covariance between Y"sz and Lx- 
2) Aperture induced corrections due to the difference in 
mass calibration between Pll and the bias-corrected V09 
model. 3) A correction to account for the fact that Pll-X 
measure (Yszl-^x) rather than (\nYsz\Lx) ■ 

The first of these corrections — the Malmquist bias due 
to correlated scatter — works exactly as per our discus- 
sion of the lsz _ -^200 scaling relation, with an additional 
important consideration: in addition to aperture-induced 
covariance, Isz and Lx are expected to be correlated be- 
cause they probe the same intra-cl uster medium . The 
corr elation coeffic i ents p redicted in iStanek et al.1 ()2010l ) 
and lAngulo et al.1 (|2012f ) are r ~ 0.8 and r ps 0.5 respec- 
tively. We set r sz x \ m = 0.65±0.2, where the uncertainty 
in r is propagated into the uncertainty of the amplitude 
of the Ysz-Lx relation. We also use the scatter estimates 
for <J sz \ m and cr x i m from the V09 model, and propagate 
the associated uncertainty. The net effect of this correc- 
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TABLE 3 
Input Scaling Relations at z 



0.23 



Relation 


Xo 




a 


°lnVlx 


Sample 


L x -M 


4.8 


1.16 ±0.09 


1.61 ± 0.14 


0.40 ± 0.04 


V09 


D 2 A Y SZ -N 200 


40.0 


-0.20 ±0.13 


1.95 ± 0.07 


0.70 ±0.15 


maxBCG 


LX-N200 


40.0 


0.04 ± 0.04 (ran) 


1.63 ± 0.06 (ran) 


0.83 ± 0.03 (ran) 


maxBCG 






± 0.09 (sys) 


±0.05 (sys) 


± 0.10 (sys) 





a Conventions as per Table[2] X-ray luminosity is measured in the [0.1, 2.4] keV band, in units 
of 10 44 ergs/s. 




Bias Corrected 




M 500 c (Me) 



0.0 0.2 0.4 
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Fig. 5. — Top-left panel: 68% confidence interval for the Ly^-M scaling relation at z = 0.23 as estimated from the maxBCG catalog 
and V09 data sets. Both sets of scaling relations are bias-corrected, assuming a 10% overestimate of the maxBCG masses, and a 15% 
hydrostatic bias (at fixed aperture, 21% total bias) on the X-ray masses. Remaining panels: 68% and 95% confidence regions of the Ly^—M 
parameter for the bias-corrected maxBCG (filled contours) and V09 (solid black curves) data sets. The amplitude parameter is defined at 
the pivot point M = 3 X 10 14 Mq, which is appropriate for the bias-corrected maxBCG data set. 
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2.4 




-1.0 -0.5 0.0 0.5 1.2 1.4 1.6 1.8 2.0 2.2 2.4 

Amplitude Slope 

Fig. 6. — Top-left panel: 68% confidence interval for the Ygz-M scaling relation at z = 0.23 as estimated from the maxBCG catalog 
and V09 data sets. Both sets of scaling relations are bias-corrected, assuming a 10% overestimate of the maxBCG masses, and a 15% 
hydrostatic bias (at fixed aperture, 21% total bias) on the X-ray masses. Remaining panels: 68% and 95% confidence regions of the Ygg-M 
parameter for the bias-corrected maxBCG (filled contours) and V09 (solid black curves) data sets. The amplitude parameter is defined at 
the pivot point M = 3 X 10 14 Mq, which is appropriate for the bias-corrected maxBCG data set. 
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Fig. 7. — Top-left panel: 68% confidence interval for the Ysz~M scaling relation at z = 0.23 as estimated from the maxBCG catalog 
(yellow band) and V09 (black lines) data sets. Both sets of scaling relations are bias-corrected, assuming a 10% overestimate of the maxBCG 
masses, and a 15% hydrosta tic bias (at fixed aper t ure, 2 1% total bias) on the X-ray masses. Also shown as a purple b and is the 68% 
confidence interval quoted in Planck Collaboration (2011a) from stacking X-ray selected clusters from the MCXC catalog (Piffarctti ct al. 
120111) . We have corrected these fits for Malmquist bias due to correlated scatter, and aperture corrections due to the mass bias relative 
to V09 (see text). The green line is our own fit to the Pll(z=0.23) data, modified for selection effects as described in Pll. The data 
points with error bars are clusters with z £ [0.13,0.3] in Pll. Blue points are cool-core systems, while red points are not. The green line 
is the best fit relation to the Pll(z=0.23) data. Remaining panels: 68% and 95% confidence regions of the lgz-M parameter for the 
bias-corrected maxBCG (yellow contours) and V09 (solid black curves) data sets. The amplitude parameter is defined at a pivot point 
Lx = 2 X 10 44 ergs/s, which is a compromise between the V09 and maxBCG predictions. The green contours corresponds to our fit of the 
Pll data, while the purple contours are those obtained from the MCXC catalog. 



tion is to lower the Y"sz amplitude by « 7% ± 2.5%. 

The aperture correction to the Pll-X data has to do 
with the dependence of Yqz on the integration aperture 
i?50o- From paper 1, we know that a bias b m in the mass 
induces an amplitude shift Aa = 0.27 In b m on Ysz- The 
Pll(z=0.23) masses are biased by AlnM 0.2 relative 
to V09, which itself suffers from a 15% hydrostatic bias 
in our bias-corrected model. All told, this increases the 
Y sz amplitude by « 10% ± 3%. 

Finally, the cluster stacks in Pll-X measure (Ysz\Lx) 
rather than (lnYsz|£x), so their reported amplitude 
must be lowered accordingly, by re 8% ± 2%. Put to- 
gether, these three corrections result in an amplitude 
shift of Aa sz \ x = -0.050 ± 0.044. We apply this system- 
atic correction to the Pll-X data, and add the associated 
uncertainty in quadrature to their quoted errors. 



The top-left panel in Figure [7] compares the predicted 
Ysz-Lx scaling relation derived from the V09 data set 
(black solid lines) with that of the maxBCG catalog (yel- 
low band). Note for both of these, we only consider the 
bias-corrected scaling relations (mass enters via the aper- 
ture corrections to Ysz)- The purple band is the observed 
relation in the MCXC cluster catalog from Pll-X. The 
best fit line to the Pll(z=0.23) data set is shown as a 
solid green line. The points with error bars are individual 
clusters in the Pll(z=0.23) data set, blue for cool-core 
systems, and red for non cool-core systems. 

The agreement between the various data sets is best 
judged using the remaining panels of Figure [3 where we 
show the corresponding 68% and 95% confidence con- 
tours for the scaling relation parameters. Focusing on 
the top-right plot, which shows the amplitude and slope 
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parameters, we see that the Pll(z=0.23) data is in some 
tension with the other data sets: the amplitude is too 
high. The simplest explanation for this offset would be 
an underestimate of the SZ selection effects in Pll. 

There is, however, reasonable agreement between the 
maxBCG, V09, and Pll-X data sets: the la Pll-X data 
curve in the top-right panel overlaps the la contours for 
both the maxBCG and V09 data sets. All three data 
sets are in excellent agreement on the intrinsic scatter in 
the Ygz—Lyi relation. To achieve better agreement, the 
Ygz signal of the maxBCG galaxy clusters would need 
to increase by » 10% — 20%, especially at low masses; 
a decrease of the V09 Y"sz signal would be difficult to 
reconcile with X-ray expectations for the Ysz/Yx ratio, 
while shifts of the X-ray luminosity are limited by spatial 
abundance constraints. 

7. THE MASS SCALE OF GALAXY CLUSTERS 
7.1. The Mass Scale of maxBCG Galaxy Clusters 

We have already argued that the origin of the bias in 
X-ray masses proposed in this paper is simple hydro- 
static bias. One interesting question that remains open, 
however, is where did the bias in the stacked weak lens- 
ing mass calibration in R09 come from? At least part 
of this answer has to do with the impact of covariancc 
between mass and weak lensing mass at fixed richness. 
Covariance between M w i and -/V200 introd uces corrections 
in ma ss calibration that were ignored in iJohnston et al.l 
(|2007| ) and R09. Specifically, from Appendix A in paper 
II we have 

(\nM w i\N 200 ) 

^wl\m ~t~ ^wl\m 

(In M \N 200 ) 

where we have assumed 

(In M w i\M) = a wl \ m + a wl \ m \nM. (11) 

Assuming the weak lensing mass estimates are unbiased 
in the sense that (hiM w i\M) = InM, we can set a w \\ m = 
and ct w \\ m = 10 Inserting this into our expression for 
(In M w i \N200 }, and solving for (lnM|A?2oo) we find 

(]nM\N 2 oo) = {\jxM w i\N2oa)-r w i tn \ m Pa m \ w ia m \ n (12) 

We see then that any covariance between M w i and A200 
at fixed mass implies that the weak lensing masses must 
be corrected downwards. This is exactly the same type of 
Malmquist-bias correction that we applied on the Ysz - 
7V 2 oo an d Ygz~Lx scaling relation s, though this covari- 
ance is not aperture- induced. In lAngulo et al.l (|2012D . 
the correlation coefficient r w [ n \ m is estima ted to be r ~ 
0.4, in good agreement with the valu e fromlNoh fc Cohn 
( 2012t ). Assuming a w i\ m « 0.2 as per lBecker fc Kravtsov 
( 20111), a n \m ~ 0.4, and f3 w 3, and setting r to the 
lAngulo et al.l (|2012h value, we find that the mass calibra- 
tion from R09 should be corrected downwards by 10%. 
This correction is in excellent agreement with the conclu- 
sions from our study. As in the case of the ISZ-A200 rela- 
tion, the covariance between M w \ a nd iV^oo may be some- 
what overestimated. For instance, lAngulo et al.l (|2012f ) 

In practice, unbiased weak lensing masses would result in 
(M wl \M) = M rather than (\nM wl \M) = InM. The difference 
between these two assumptions is a net offset of 0.5<T^ ; | m ~ 2%, 
which is negligible for our purposes. 



measured M w i within (i?5oo|A r 2oo}> whereas M w \ in R09 
comes from directly fit ting a halo model t o the shear 
data. Consequently, the lAngulo et al.l (|2012l ) value likely 
needs to be reduced for the impact of aperture-induced 
covariance P^l However, orientation effects due to optical 
cluster selection does appear to induce some covariance 
that can lead to ~ 6% overestimates of cluster masses 
if this affects is unaccoun ted for (Jorg Dietric h, private 
communication) , and the iNoh fc Cohnl ([2012D value for 
the correlation coefficient is not affected by aperture- 
induced covariance. Overall, it seems clear that a « 10% 
downwards correction to the R09 masses due to covari- 
ance between M w i and -/V200 is a plausible explanation 
for the shift in the mass-scale of maxBCG galaxy clus- 
ters needed by our self-consistent model of multi-variate 
cluster scaling relations. We note, however, that such 
a shift is also within the systematic error allotted due 
to cluster miscentering and source photometric rcdshift 
errors. 

In this context, it is also worth emphasizing that 
while much of the work addressin g the PI 1-opt data 
has focused on the IJohnston et al.l (120071 ) scaling rela- 
tio n, this is not well justified. As was first pointed out 
by IMandelbaum et all (|2008bD . one expects significant 
phot ometric redshift correc tions relative to the raw data 
from IJohnston et al.l (|2007l) . These correction arise be- 
cause the lens and source populations in the SDSS arc 
overlapping, which tends to dilute the weak lensing sig- 
nal. Moreover, the inverse critical surface density varies 
quickly with source redshift when z source ss zi ens , making 
weak lensing masses more sensitive to photometric errors. 
R09 a pplied these co rrections, and combined the cor- 
rected ISheidoEiEI^ (|2007f ) 
analysis with that of lMandelbaum et al.l ()2008al ) to place 
priors on the M-N200 relation of maxBCG galaxy clus- 
ters, including this effect. These corrections should be 
applicdQ Moreover, there are now several independent 
stacked weak le nsing mass calibrations of the maxBCG 
galaxy clusters (IMandelbaum et"aTI l2008at iSimet et al.l 
l2012t iBauer et afll2012l ). all of which are consistent with 
the R09 scaling relation. Importantly, all these stud- 
ies are also subject to the covariance-induced Malmquist 
bias noted above. 

7.2. Preferred Set of Scaling Relations 

We have demonstrated that the bias-corrected R09 and 
V09 cluster scaling relations are fully self-consistent; not 

12 It is worth emphasizing that whether the correlation co- 
efficient needs to be reduced by this effec t or no t has nothing 
to do with problems in the lAngulo et al] 1)20121 ) analysis, but 
rather detailed differe nces in how the mass is estimated betwee n 
IJohnston et al.1 ( 120071) and the simulations of lAngulo et al.1 (120121 ') : 
the weak lensing mass measurements are just not identical in detail. 
In other cases, such as for the Ygz measurement that we discussed 
earlier, such aperture-induced c ovariance need s to be explicitly in- 
cluded as per the simulations of lAngulo et al. (2012). 

13 In fact, there is additional evidence that the original 
IJohnston et al.1 l|2007l) mass es suffer from photometric rcdshift 
biases. IRozo et al.l j|2009bl) demonstrated that there is large, 
systematic-driven evolution in the richness— mass relation of the 
maxBCG galaxy clusters, seen both in the X-ra ys and velocity dis- 
persions HBecker et al.ll2007l; IRvkoff et al.ll20TM) . This evolution is 
not seen in the weak l ensing data. Systematic re dshift errors like 
those pointed out by Mandclbaurn et al. (2008a) have the right 
magnitude and sense to explain the lack of evolution in the weak 
lensing data. 
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TABLE 4 

Preferred Set of Scaling Relations 



Relation xo Amplitude (a^,| x ) a i>\x "Im/Hx Sample 

L x -M 4.4 0.72 ±0.07 (ran) ±0.16 (sys) 1.55 ±0.09 0.39 ± 0.03 V09±maxBCG 

D\Y SZ -M 4.4 0.87 ±0.06 (ran) ±0.17 (sys) 1.71 ±0.08 0.15 ± 0.02 V09±maxBCG 



M-N 20 o 40 0.75 ±0.10 1.06 ±0.11 0.45 ±0.10 maxBCG 

Lx-N 20 o 40 0.04 ±0.10 1.63 ±0.08 0.83 ±0.10 maxBCG 

Y sz -N 2 oo 40 -0.24 ±0.20 1.97 ±0.10 0.70 ±0.15 maxBCG 



Ysz-Lx 1.0 -0.29 ±0.06 1.10 ±0.03 0.40 ± 0.05 Pll-X 



a X-ray luminosity is measured in the [0.1,2.4] keV band in units of 10 44 ergs/s. -D^lsz is m 
units of 10 -5 Mpc 2 . The maxBCG scaling relations are bias-corrected, while the V09±maxBCG 
relations are the joint constraint from the bias-corrected V09 and maxBCG samples. Scaling 
relations involving mass include a ±10% systematic uncertainty in the mass. The error in the 
amplitude of the Ysz~^x relation is larger than that quoted in Pll-X because we include the 
uncertainty in our systematic corrections. This set of scaling relations is fully self-consistent. 



only are the observable-mass relations derived from op- 
tical and X-ray selected cluster catalogs consistent with 
one another, the optical and X-ray spatial abundance 
functions are consistent with each other and with cos- 
mological expectations. Because the Lx~M and Ysz~ 
M scaling relations predicted from the bias-corrected 
maxBCG and V09 scaling relations are in good agree- 
ment with each other, we combine them to arrive at our 
preferred estimates for these relations. These are sum- 
marized in Table |4l along with our preferred set of scaling 
relations for M-N 20 o, L x ~N 20 o, Y sz -N 2 oo, and Y SZ -L X - 
For this last set of scaling relations we do not attempt 
to combine the maxBCG and V09 bias corrected results: 
rather, we rely on the measurement that most directly 
probes each scaling relation. We emphasize, however, 
that this set of scaling relations is still fully self-consistent 
with the quoted uncertainties. The pivot point for Lx~ 
M and Ysz~M is set to 4.4 x 10 14 M has been chosen 
so as to decorrelate the amplitude and slope of the joint 
scaling relations. We expect a conservative estimate of 
the systematic uncertainty due to the hydrostatic bias 
and weak lensing bias corrections is 10% in mass, which 
we expect is best modeled as a top-hat distribution rather 
than a Gaussian. This error corresponds to a ±0.16 error 
in the amplitude for Lx~M, and a ±0.17 uncertainty in 
the amplitude for Ysz~M . We now employ our preferred 
set of scaling relations to predict the cluster masses for 
each of the CLASH systems, and to compare the masses 
recovered from these scaling relations to those from var- 
ious works in the literature. 

7.3. CLASH Predictions 

Our final set of scaling relations in Table [4] fits an 
extra-ordinary amount of data, and satisfies a large vari- 
ety of highly non-trivial internal consistency constraints. 
As such, we believe it can provide a critical low-rcdshift 
foundation for the study of cluster scaling relations and 
their evolution. In additio n, these relations pr ovide a 
clear target for the CLASH (Po stman et al.ll2012f) exper- 
iment, which seeks to provide very high precision masses 
for a small subset of X-ray and lensing selected galaxy 
clusters. In Table we have provided predictions for 
the cluster masses for each of the z < 0.4 CLASH galaxy 
clusters based on their X-ray luminosity, as quoted in 
M10. We have limited ourselves to z < 0.4 clusters to 
minimize the impact of rcdshift evolution in our scal- 



ing relations, for which we assume self-similar evolution, 
Lx oc E 2 (z), as appropriate for soft X-ray band cluster 
luminosities. For the few CLASH systems not in M10, 
Mantz shared with us X-ray luminosities derived using 
the same data analysis pipeline as in M10 (Mantz, pri- 
vate communication). Note that to make these predic- 
tions, we utilize the distribution P(M\Lx) rather than 
P(Lx\M). The two are related as described in Appendix 
A of paper II, and we have assumed a slope of the halo 
mass function of /3 = 4, as appropriate for ss 10 15 Mq 
galaxy clusters. 

Unfortunately, the large scatter in the Lx~M relation 
implies that our predicted masses are highly uncertain, 
as there is an irreducible ±25% intrinsic scatter. Pre- 
dictions using the SZ signal arc significantly more pre- 
cise, but there is only one z < 0.4 CLASH system with 
SZ measurements in the Pll sample, Abell 2261, which 
we discuss more fully below. Y"sz estimates for these 
galaxy clusters is very desirable for the purposes of tight- 
ening our theoretical predictions. We also note that all 
of our mass predictions have an overall systematic floor 
of ±10%. As emphasized in paper II, it is worth re- 
membering that the mass estimates from scaling rela- 
tions are only as good as the input data used to estimate 
the clust er masses. For instan ce, in the MCXC cluster 
catalog dPiffaretti et al.ll201lD . clusters A 383, A 209, 
A 1423, A 611, and MACS J1532, all have luminosities 
that differ from the ones quoted above by 30% or more. 
Should these luminosities be correct, then our mass pre- 
dictions would necessarily be biased by the corresponding 
amount. 

At this time, there are only two clusters with published 
masses from a full-lensing analysis of CLASH data: Abell 
2261 and MACS J1205-08. Our prediction for Abell 2261 
is M 50 oc = 1-07 ± 0.28 (stat) x 10 15 M based on its X- 
ray luminosity, and M 5Q0c = 1-16 ± 0.14 x 10 15 M Q 
based on its SZ signal (both have an additional ±10% 
top- hat systematic uncertainty). For the latter, we again 
assume self-similar evolution, with Ygz cx E(z) 2 ^ 3 , and 
we have applied the expected aperture correction due 
to the shift i n mass cal i bratio n between our work and 
that of Pll. iCoe et ail (|2012| ) quote virial masses and 
concentrations rather than Af5oo c , so we use their best 
fit model to convert their numbers to M^oq c . They find 
M 500c = 1.34 ± 0.13 x 10 15 M Q , in excellent agreement 
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TABLE 5 

A%ooc Predicted Masses for z < 0.4 CLASH Clusters 



Cluster 


z 


L x (10 44 ergs/s) 


Af 8 00c ( 


10 15 M Q ) 


A 383 


0.187 


5.9 ± 0.2 


0.69 


± 


0.18 


A 209 


0.206 


8.6 ± 0.3 


0.87 


± 


0.23 


A 1423 


0.213 


6.2 ± 0.4 


0.70 


± 


0.19 


A 2261 


0.224 


12.0 ± 0.4 


1.07 


± 


0.28 


RX J2129 


0.234 


9.9 ± 0.5 


0.94 


± 


0.25 


A 611 


0.288 


7.5 ± 0.4 


0.75 


± 


0.20 


MS 2137 


0.313 


12.3 ± 0.4 


1.02 


± 


0.27 


MACS J1532 


0.345 


19.8 ± 0.7 


1.36 


± 


0.36 


RX J2248 


0.348 


30.8 ± 1.6 


1.80 


± 


0.49 


MACS J1931 


0.352 


19.7 ± 1.0 


1.36 


± 


0.36 


MACS J1115 


0.352 


14.5 ± 0.5 


1.10 


± 


0.29 


MACS J1720 


0.391 


10.2 ± 0.4 


0.86 


± 


0.23 


MACS J0429 


0.399 


10.9 ± 0.6 


0.89 


± 


0.24 



a X-ray luminosities are all from M10, or from Mantz (private 
communication). The quoted mass uncertainty for our predicted 
masses (4 th column, based on our preferred Lx-M relation) is 
statistical only, including the intrinsic scatter in the M-Lx scal- 
ing relation, and uncertainty in the X-ray luminosity estimates. 
There is an additional overall ±10% systematic uncertainty in 
the cluster masses that scales all clusters uniformly. 

with either of our predictions. By comparison, the mass 
of A2261 in Pll — which Pll obtains based on their Yx 
measure ments — i s M^ c = 0.64 ± 0.04, is in tension 
with the iCoe et al.l (|2012t ) value at 5<r. 

Turning to MACS J1205-08, our predicted masses from 
L x and Y sz are M 50 0c = 1-26 ± 0.34 x 10 15 M Q and 
M 500c = 1.36 ± 0.20 x 10 15 Mo respectively. For the 
Lx-based mass, we relied on the X-ray lum i nosity from 
Pll. The reported mass in lUmetsu et al.l (|2012l ) cor- 
responds to Af 50 oc = 1.01 ± 0.15 x 10 15 M Q , in excel- 
lent agreement with both our X-ray and SZ derived clus- 
ter mass estimates. The corresponding mass in Pll is 
M^ooc = 1.08 ± 0.24, also in good agreement with the 
CLASH measurement. We emphasize that the predicted 
uncertainties in our SZ masses are 12% and 15% respec- 
tively, so the agreement between our predicted masses 
and those from the CLASH c ollaboration is non-tr ivial. 
It should also be noted that lUmetsu et al.l (|2012t) find 
good agreement between their lensing mass estimates 
and direct hydrostatic mass estimates from a joint analy- 
sis of Chandra and X MM data and an inde pendent anal- 
ysis of SZ data as per iMroczkowskil (|2011| ). 

7.4. Predicted Masses for V09, M10, and Pll 

Just as we predicted the cluster masses for the CLASH 
cluster sample, we can use our best fit Lx~M relation 
to derive cluster masses for each of the galaxy clusters 
in V09, M10, and Pll. Our results are summarized in 
Figure[H where we plot In (M\Lx) —In Mm versus cluster 
redshift for each of the three cluster samples: V09 (black 
points), M10 (red points), and Pll (blue points). To 
evolve the scaling relation away from z = 0.23, we assume 
a self-similar evolution Lx oc E 2 (z). In deriving these 
masses, we use the X-ray luminosity quoted in each work. 

As expected, our recovered masses are higher than 
those from V09: 11% ± 4% for the low redshift sample, 
and 22% ± 5% for the high redshift cluster sample. Note 
that our putative bias-corrected V09 scaling relation as- 
sumed a 15% hydrostatic bias, which would naturally 
result in a w 20% offset. However, upon combining this 



scaling relation with maxBCG one, the mass scale de- 
creases slightly, leading to the smaller low redshift bias 
estimated above. Turning to M10, we find that our pre- 
dicted masses are in excellent agreement with theirs: the 
mass offset is A In M = 2% ± 3%, and there is no evi- 
dence of evolution in the mass offset. M10 notes that a 
Chandra calibration update that appeared as the work 
was being published lowers their masses relative to the 
published values by w 11%, which would result in a net 
offset A In M = 13% ± 3%. 

Turning to the Pll data set, for the low redshift sam- 
ple (z < 0.13) we see a clear redshift trend in the 
data, with the mass offset increasing with increasing red- 
shift. Whether this trend will remain as deeper data 
comes out remains to be seen. If we ignore this evolu- 
tion and simply compute the mean mass offset, we find 
A In M = 9% ± 6%. For the high redshift cluster sample 
(z > 0.13), we no longer see systematic evolution in the 
mass offset, finding A In M = 21% ±4%. We warn, how- 
ever, that because these clusters were also SZ-selected, 
the SZ selection may pick up the more massive clusters 
at fixed X-ray luminosity. Such a selection would tend 
to increase the X-ray masses from Pll, thereby reducing 
the apparent offset relative to our mass calibration. To 
test for this possibility, we also c omputed the mass o ffset 
relative to the X-ray masses in iPratt et al.l j2009), for 
which we find A In M = 28% ± 0.05%. 

The mass offset between our predicted masses and 
those of M10 and Pll are very surprising. Naively, we 
would have expected the sum of the offsets between our 
masses and those in M10 and Pll to be consistent with 
the M10-P11 offset, but this is not the case. That is, 
despite the fact that at at z « 0.3 we find that the M10 
masses are higher than those in Pll by « 45% ± 6%, 
when we look at the full M10 and Pll cluster samples, 
our masses are only « 20% higher than those in Pll, and 
consistent with M10. The difference is due to the partic- 
ular cluster sub-sample on which we based the M10-P11 
comparison: for this sub-sample, our predicted masses 
arc 18% ± 6% lower than those in M10, and 29% ± 4% 
higher than those in Pll. Evidently, exactly which clus- 
ters are used for these type of comparisons can dramat- 
ically impact the conclusions we draw. One possible ex- 
planation for this difference is that the SZ selection of 
the Pll cluster sample implies that high-Lx, low M gas 
systems are under-represented in the Pll sample. For 
instance, the Pll sample may be picking out the tail of 
high M gas clusters in M10 at fixed Lx- Qualitatively, 
this is exactly analogous to the problem of the MCXC 
sub-sample of maxBCG galaxy clusters: starting from a 
sample selected on a noisy estimator (Lx), the interpre- 
tation of sub-samples selected using higher-quality mass 
proxies can be subject to important selection effects due 
to observable covariance. Quantitatively addressing this 
question, however, would require extensive simulations 
with covariant scatter between Lx, Y$z, and M gas , which 
is beyond the scope of this work. This does, however, il- 
lustrate that the type of naive comparisons that we are 
performing in this section and the next can in principle 
be subject to significant selection effects due to covari- 
ance between cluster observablcs. 

7.5. Comparison to Other Weak Lensing Masses 
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Fig. 8. — The mass offset \n(M\Lx) — haMm between the ex- 
pectation value of the cluster mass as per our scaling relations, and 
the mass quoted in V09 (black points), M10 (red points), and Pll 
(blue points). The lines show the mean mass for each of the various 
cluster sub-samples, including a low/high rcdshift partition of the 
V09 and Pll cluster samples. The length of each of the lines shows 
the rcdshift range used to define each cluster subsample. We plot 
a power-law model in 1 + z whenever the best fit power-law has 
a non-zero slope at more than 2<r significance. Otherwise, we plot 
the inverse-variance weighted mean. The square with error bars in 
the lower left hand corner is to illustrate the typical uncertainty in 
the offsets, which is entirely driven by the intrinsic scatter of the 
M-Lx relation. 



We now perform a similar exercise to the one above, 
but focusing on masses derived from weak lensing anal- 
yses instead. The LoCuSS collaboration^ have set out 
to accurately calibrate cluster scaling relations by pro- 
viding detailed weak lensing masses of a sample of ~ 100 
X-ray selected galaxy clusters. Their larges t compilation 
of wea k lensing masses to date is that from lOkabe et all 
(|2010l) . In a recent work, the Planck collaboration 
compared their X -ray derived masses to those from 
IQkabe et all (|2010[ ). finding that the latter are « 20% 
lower than the X-ray masses derived by the Planck team 
(| Planc k Colla boration! [20121 ) . This measurement stands 
in stark contrast to the solution advocated here, as we 
have argued that the Pll masses are too low by « 20%. 

In Figure HI we com pare our pre d icted masses from 
Lx to those quoted in Okabe e t al.l (120101 red points). 
There is a very large offset between our predictions and 
their masses, AlnM = 0.53 ± 0.07, nearly a factor of 
two. This value is somewhat higher than the expected 
offset of w 0.4 derived from the naive sum of the Pll 
of fset relative to our predictio ns, and the offset quoted 
in lPlanck Collaboration! (|2012D . 

As was demonstrated in paper II, the mass calibration 
used by Pll results in strong (> 4cr) tension between 
the observed X-ray luminosity function, and the cosmo- 
logical expectations for a WMAP7 cosmology. Further 
lowering the clus ter masses c a n onl y increase this ten- 
sion. Should the IQkabe et all d2010h masses be correct, 
the X-ray luminosity function in lBohringer et all ()2004l ) 
would need to be corrected upwards by a factor of 3 to 
4, implying the REFLEX catalog is « 70% incomplete. 
We find this level of incompleteness much too high to 



Fig. 9. — The mass offset In (M\Lx) — kiMm between the expec- 
tation value of the cluster mas s as per our scaling relations, and the 
weak lensing mass qu oted in IQkabe et alj (|201CH . red points) and 
IMahdavi et a l. (2008, black points). The lines show the inverse- 
variance weighted mean for each sample. The squares with error 
bars in the upper right hand corner illustrate the typical uncer- 
tainties in the offsets, which can be dominated by the weak lensing 
uncertainties. 



be plausible, so we expect there remains an unknown 
source o f systematic bias in the weak lens ing measure- 
ments of IQkabe et al.1 ([20101. Interestingly. IQkabe et al.l 
(|2010f) measured the CLASH cluster A 2261 as part of 
their Subaru weak lensing campaign. For this cluster, 
they find Af 50 o c = 8.14 ± 1.17 x 10 14 M E3 This mass 
is indeed biased low relative to the CLASH value, with 
the net mass offset being A In M = 0.50 ± 0.18. 

We also compa r e our overall mass calibration to that 
of lMahdavi et al.l (|2008l) , relying on luminosity estimates 
from M10 to perform the comparison. Th is cluster sam- 
ple is identical to that of iHoekstral ()2007l ). but updated 
with improved photometric rcdshifts for source galaxies. 
Our results are shown in Figure |9] using black points. 
The mean mass offset is A In M = 0.22 ± 12, with 
our pr edicted masses being larger than those of lHoekstral 
(|2007|) . Given the large uncertainties in the weak lens- 
ing masses, however, the two data sets are statistically 
consistent with each other. 

For complete ness, we have also se arche d for clusters 
in common to IMahdavi et~aTl (|2008l ) and IQkabe et all 
(|2010D . finding 6 such systems. One cluster, Abell 209, 
might be an outlier, but it is difficult to tell with such 
a small sample size. The mean mass offsets with and 
without Abell 209 are -0.27 ±0.29 and -0.37 ±0.18 re- 
spectively. Note that despite the large offsets between 
the two works, the scatter is such that based on these 6 
systems we cannot conclude that the two data sets are 
i nconsistent with each o ther. 

iHoekst r a et al.l (1201 If) have extended the cluster sam- 
ple from IMahdavi et al.l (|2008| ) to include low mass, high 
redshift systems. We assume a concentration parame- 
ter C500 = -R500C/-R2500C = 3 to convert their reported 
M2500C masses to Msoocj but note that our conclusions 
are sensitive to the assumed value of c§oo- The value 
we adopt here is consistent with X-ray observations (e.g. 
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Vikhlinin et al.l I2006T) an d numerical simulations (e.g. 
Bhattacharva et al.ll2011l) . A reasonable estimate for the 
systematic uncertainty in the mass corrections is ±10%. 
With our fiducial correction, we find that the mean mass 
offset between the two works is A In M = 13% ± 13%. 
Thus, o ur predicted m a sses a re fully consistent with 
those in iHoekstra et al.l (|201lD . We caution, however, 
that there is very large scatter in this comparison, re- 
flecting the large errors in the weak lensing masses: 12% 
of the systems in that work have a reported mass that 
is within la of zero, and a full 68% of the systems are 
within 2cr of zero. 

The final work we consider here is that of lUmetsu et al.l 
(|201lD , who combined magnification and shear measure- 
ments to derive constraints on five galaxy clusters: A 
1689, A 1703, A 370, CL0024+17, and RXJ1347-11. The 
inverse variance weighted mean offset relative to our pre- 
dictions (using luminosities in M10 or provided by Adam 
Mantz, private communication) is A In M = — 0.42±0.14, 
with our predicted masses being lower. This offset re- 
flects the fact that there are two systems with fairly 
large offsets: A370, with A In M = -0.85 ± 0.29, and 
CL0024+17, with A In M = -1.26 ± 0.30. The offset 
for CL0024+17 (L x = 2.3 ± 0.1) is so large that it is 
the only system we have encountered where our pre- 
dicted mass was more than 3<r away from the reported 
mass. Both A370 and CL0024+17 are truly exceptional 
systems. A3 70 was one of th e first cluster lenses ever 
identified (jSoucail et al. 1987a. b), and may be the most 
mass i ve lensing cluster in t he Universe (jBroadhurst et all 
120081: lUmetsu et all 120 111) . It also exhibits bimodal- 
ity i n its mass distribu tion, both in X-rays and lens- 
ing (jRichard et alJ|20To() . with the two components sep- 
arate d in redshift by ps 3,000 km/s (jde Filippis et al.l 
120051 ). CL0 024+17 is also kno wn as an exceptional lens- 
ing system (jZitrin et al.l 120091 ). and appears to be a re- 
cent cluster merger along the line of sight, leading to high 
weak lensing mass a nd a highly diffuse gas distribution 
(|Umetsu et aLll2010D . Given that these two clusters are 
known to be some of the most effective lenses in the Uni- 
verse, and the small sample size of the cluster sample, we 
are loathe to derive any conclusions from this compari- 
son. 

A final large compilation of homogeneously ana- 
lyzed weak l ensing masses of galaxy clusters is that of 
lOguri et al.l (|2012[ ). Unfortunately, these systems are 
typically not very X-ray bright: they were all first identi- 
fied in the optical. Consequently, X-ray luminosities for 
these systems are not easily available in the literature, 
preventing us from performing a comparison with that 
work. A summary of the mass offsets for both X-ray and 
weak lensing samples is presented in Table [6l As noted in 
the previous section, however, we emphasize that a more 
meaningful comparison would require a careful treatment 
of selection effects to avoid reaching biased conclusions 
because of intrinsic observable covariance. 

7.6. The Thermal SZ Power Spectrum 

As final check on our preferred set of scaling relations, 
we consider the implications of our Y^z—M relation on 
the thermal SZ (tSZ) spectrum. Specifically, we compare 
the predicted amplitude of the tSZ power spectrum de- 
rived from our preferred scaling relatio n to observations 
from the South Pole Telescope (SPT, iReichardt et al.l 



TABLE 6 

Mass Offset Between our Predicted Masses and Values 
from the Literature 



Work In (M\L X ) -In M Ht No. of Clusters 

in Sample 



V09 (z < 0.3) 0.11 ±0.04 49 

V09 (z > 0.3) 0.22 ±0.05 36 

M10 0.02 ±0.03 95 

Pll (z < 0.13) 0.09 ±0.05 24 

Pll (z > 0.13) 0.21 ±0.04 38 

Okabe et al. (2010) 0.53 ± 0.07 21 

Mahdavi et al*72uu8 ) 0.22 ±0.12 11 

Hoekstra et al. (20U) 0.13 ±0.13 25 

Umetsu et aT~?20lTT -0.42 ± 0.14 5 



a Mean mass offset between our predicted masses using Lx , and 
those reported in the literature. All means are inverse-variance 
weighted. 

120 111) . Following their convention, we characterize the 
amplitude of the thermal power spectrum in terms of 
its va lue relative to the fiducial m odel of IShaw et all 
pOloT ). That is, by definition, the IShaw et all (|2010f ) 
model corresponds to Atsz = 1- The value of the best 
fit amplitude A t sz to the SPT data depends on whether 
or not one allows for possible covariance with the Cos- 
mic Infra-red Background (CIB). The best fit values are 
Atsz = 0.70 ± 0.21 when assuming no covariance, and 
Atsz = 0.60 ±0.24 when marginalizing over any possible 
covariance. 

These values are to be compared with those predicted 
from our b est fit Ysz~M rela tion assuming self-similar 
evolution. IShaw et all (120101) found that the tSZ am- 
plitude for the lArnaud et all (120101 ) Y sz -M model is 
Atsz,Amaud = 127, in moderate tension with the data. 
The principal differe nce between our joint best fit Y§z~M 
relation and that of lArnaud et al.l (|2010( ) is the normal- 
ization, which differ in amplitude by Aa sz \ m = 0.465. 
Since A t sz oc Y" S 2 Z , the predicted amplitude A t sz from 
our joint Y$z~M scaling relation is A t sz = 0.50±0.10, in 
excellent agreement with the data. The offsets relative 
to the data with and without CIB covariance are 0.4a 
and 0.9cr respectively. 

8. SUMMARY AND DISCUSSION 

We have demonstrated that the bias-corrected R09 and 
V09 cluster scaling relations are fully self-consistent; not 
only are the observable-mass relations derived from op- 
tical and X-ray selected cluster catalogs consistent with 
one another, the optical and X-ray spatial abundance 
functions are consistent with each other and with cos- 
mological expectations. 

Table [4] summarizes our preferred set of scaling rela- 
tions. As usual, the scaling relations with mass are dom- 
inated by systematic uncertainties, which we estimate 
at the ±10% level on the cluster mass. Note that other 
than the joint V09±maxBCG L x -M and Y sz -M scaling 
relations, we have not attempted to combine all of our 
data sets: we have simply selected the cluster sample 
that probes each cluster scaling relation most directly. 
In principle, one could use the model from Appendix A 
in paper II to perform a full likelihood analysis on the full 
collection of data sets. Such an analysis would recover 
not only constraints on the individual scaling relations, 
but also on the various correlation coefficients. Such an 
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analysis, however, is beyond the scope of this work. In 
fact, we think of our work as a necessary precursor to 
such an analysis: given the surprising results presented 
in Pll-opt concerning the 3^z _ -^200 relation, it was nec- 
essary to investigate whether there even existed a model 
that could give a good fit to all of the data currently 
available. Otherwise, it would make little sense to at- 
tempt to combine these various cluster samples. 

Concerning the i^z _ -^200 relation, there are several dif- 
ferences between our analysis and that of Pll-opt. First, 
all of our scaling relations are self-consistently propa- 
gated using the probabilistic model descr ibed in Ap- 
pend ix A of paper II (see also Appenidx C in lWhite et al.l 
I2010T) . or through expl icit Monte Car l o met hods. In addi- 
tion, Pll-opt used the lArnaud et al.l (|2010D model for the 
Ysz-M scaling relation when predicting the Ysz-A^oo 
relation, whereas we explicitly constrained the Ysz-M 
relation for each of the three X-ray data sets we con- 
sidered, and used this as the basis of our analysis. It 
is worth noting that because the Ysz/Yy ratio in Pll 
at z = 0.23 is higher than that derived in lArnaud et al.l 
(|2010f ) . the tension between our Pll(z=0.23) prediction 
and the Pll-opt data is even stronger than that in the 
original Pll-opt paper. 

The tension between our own prediction for the 
Pll(z=0.23) Y sz -N 2 oo relation and the Pll-opt data 
has three important contributions, all involving mass. 
First, the Pll masses appear to be biased low relative to 
those from other X-ray works, particularly V09 and M10. 
In addition, we expect the masses in these two works 
should be further corrected due to hydrostatic bias, at 
the ps 5% — 15% level (see below). Finally, we had to 
lower the maxBCG weak lensing masses by 10%, a cor- 
rection that is at least partly sourced by intrinsic covari- 
ance between weak lensing mass and cluster richness at 
fixed mass. We note, however, that this systematic shift 
is also within the expected systematic error due to pho- 
tometric redshift uncertainties and cluster miscentering. 
Shifting these values around by up to ±10% is accept- 
able, but larger shifts start to introduce tension in var- 
ious places: either the observable-mass relations from 
optical and X-ray catalogs will fail to be consistent with 
one another, or the spatial abundance of galaxy clusters 
become inconsistent with cosmological expectations. 

Throughout this work, we adopted a 15% hydrostatic 
bias (at fixed aperture, 21% total bias) when model- 
ing the V09 bias-corrected scaling relations. This value 
is well supported by s imulation results spanning more 
than twenty years (e.g. lEvra rd 1990; lEyrard et al"]|1996t 
Cer3ll997tlRasia et aUl2006l: iNagai et al.ll2007HLau et al l 
2009HBattaglia et al.ll2011l : lRasia et al.ll2012f ). Curiously, 
when we directly compare the cluster masses derived 
from our preferred scaling relation to those quoted in 
V09, the net offset at low and high redshifts was only 
ps 10% and 20% respectively, suggesting an overall total 
hydrostatic bias close to ps 10% at fixed aperture, also 
well within the range found in numerical simulations. 

If hydrostatic bias were in fact significantly higher, 
say ps 30% or higher as seen in some simulations, then 
the agreement between the V09 and maxBCG data sets 
would be a fortuitous coincidence; such a large hydro- 
static bias would have to be cancelled by some other un- 
known bias in the V09 data that tends to increase cluster 
masses. In such a scenario, the Pll(z=0.23) data would 



in fact be consistent or close to consistent with maxBCG 
observations. Note, however, that the puzzle of the high 
Ysz/Yx ratio from the Pll(z=0.23) data noted in pa- 
per I would remain. Since this latter scenario requires 
a conspiracy of errors, and still leaves an open question 
unanswered, we much prefer our proposed solution. As 
for the weak lensing optical masses, the bias in the R09 
scaling relation may be explained as a combination of 
Malmquist bias due to covariance between weak lensing 
masses and A^oo- 

In summary, we believe we have been able to present 
a solution to the puzzle posed in Pll-opt: that is, we 
have constructed a set of cluster scaling relations that 
satisfies all the internal requirements for self-consistency. 
These conditions are numerous and non-trivial. In fact, 
we have seen these conditions tightly constrain deviations 
from our proposed solution: neither the mass scale of 
the maxBCG or V09 galaxy clusters can be altered by 
much more than « 10% without introducing tension in 
either some cluster scaling relation, or with cosmological 
expectations for the cluster spatial abundance. We note 
too that our recovered Ysz~M scaling relation is fully 
consistent with the amplitude of the thermal SZ eff ects 
as measur ed in the SPT data (|Reichardt et al.ll2011l ). 

Having derived our preferred Lx~M and Ysz~M re- 
lations from these arguments, we have used them to es- 
timate cluster masses for several cluster samples in the 
literature, and to predict cluster masses for each of the 
z < 0.4 CLASH systems. Despite the large statistical 
uncertainty in our predicted cluster masses — which arc 
dominated by the intrinsic scatter in the M~Lx rela- 
tion — we can average over many clusters to test the 
overall level of systematic mass offset between that of 
our favored Lx~M relation, and the masses reported 
in the literature. These mass offsets are summarized 
in Table |6] We caution, however, that these mass off- 
sets can be subject to important selection effects due 
to intrinsic covariance between cluster observablcs. Im- 
portantly, we were able to also derive masses from the 
SZ observations in Pll for the two galaxy clusters that 
have been publis hed so far by t he CLASH collabora- 
tion, Abell 2261 dCoe et al.l 120121) and MACS J1205-08 
(|Umetsu et al.ll2012l ). In both cases, we find excellent 
agreement between our predicted masses and the CLASH 
results. We emphasize that this agreement if highly non- 
trivial: both our predicted masses and the results from 
the CLASH collaboration quote statistical uncertainties 
of order ps 10% - 15%. 

At this time, the largest difference between the V09 
and maxBCG catalogs involves the amplitude of the SZ 
signal, see for instance Figures [2l [6] and [TJ in all cases, 
boosting the Pll-opt Ygz measurement by ps 20% — par- 
ticularly for low mass objects — would result in better 
agreement with the V09 data set and the Pll-X mea- 
surements. We emphasize, however, that such a boost 
is not necessary at this point; the current data set is 
self-consistent in a statistical sense. Interestingly, a 20% 
boost of the YSZ-A200 amplitude corresponds to less than 
a 2a (statistical only) shift in the Ysz-A^oo relation re- 
covered from the Pll-opt data. With deeper Planck 
data, the statistical error in the YSZ-A200 relation will 
shrink significantly; should the amplitude not shift up- 
wards as the statistical precision of the measurements 
improves, this new data could very well rule out our pro- 
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posed solution for ~ 10 14 M© galaxy clusters. 
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